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THE MEANING OF STUDENT MARKS 

J. B. CANNING 

University of Chicago 



It is the purpose of a marking system to evaluate the academic 
work done by students in the courses of the various curricula. 
The marks themselves may consist of a series of letters or, prefer- 
ably, of numbers arranged in descending scale ; the most meritorious 
work is designated by a character at one end of the scale and the 
poorest, failure, by the character at the other. On the basis of 
marks given are awarded academic honors, additional credit 
toward graduation, prizes, etc.; and continued membership in 
a class or in a school may be conditioned upon maintaining some 
predetermined standard. The marks given may be expected to 
incite students to better work. Fairness in the award of honors, 
justice in determining upon failures and dismissals, and incitement 
of the student to better work can be attained only to the extent 
to which a common standard for the awarding of marks is under- 
stood, accepted, and acted upon. 

In many problems of evaluation two kinds of standards, positive 
and relative, are possible. A positive standard implies the possibil- 
ity of defining exactly some unit of measure, a foot, a pound, a 
calorie, etc., that never varies and that can be recognized and identi- 
fied. To evaluate a magnitude by a positive standard is to discover 
how many times the unit is contained in it, e.g., the number of feet 
in a coil of rope, the number of calories in a liter of alcohol, etc. 
Relative standards imply no rigid unit. Magnitudes or qualities 
to be evaluated need only be ranked from greatest to least, or from 
worst to best. A series of red cards may be ranked in order of 
brightness to the eye; or a group of men may be ranked according 
to stature from tallest to shortest without resort to any standard 
unit of measure; and if only rank in stature is wanted no positive 
unit of measure is needed. Appreciable differences of degree may 
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always be given relative appraisal, but they cannot always be 
measured positively. 

Rigidly to define a positive standard of excellence in academic 
work is impossible; for excellence in academic work is not a single 
variable but a complex of variables; hence there can be no positive 
unit of measurement. Nor are the variables themselves — logical 
thinking, previous training, interest in the work, etc. — simple 
variables subject to exact observation, measurement, and record. 
A theme written in a course in English composition cannot be 
ideally perfect nor quite ideally bad; nor can it be said of any given 
theme that it falls at a definite point on a graduated scale between 
these two extremes. To what extent an apt phrase should be 
allowed to compensate a grammatical irregularity is not a thing 
to be expressed quantitatively. Inability to add infallibly does 
not preclude all possibility of doing excellent work in plane geome- 
try. No doubt the illustrations cited will suggest a multitude of 
others to any teacher who has ever given serious thought to the 
meaning of marks in his own classes and in his own subject. Some, 
perhaps, may feel that the obstacles encountered there are not 
insurmountable, but will anyone have the hardihood to maintain 
that the relative badness of a comma blunder and of a misplaced 
decimal point, of a misspelled word and of an omitted digit, can 
be exactly determined ? Is one twice as bad as the other, or three 
times; or does it "depend upon circumstances"? And if so, 
upon what circumstances ? If a common basis for all departments 
is sought, some means of equating excellences and errors is essential. 
And since there can be no positive unit of excellence, any system of 
marking that implies positive units of excellence is foredoomed to fail. 

But it is possible to define relative standards of excellence with 
sufficient definiteness to serve the purpose of any school's marking 
system. Of two themes written as samples of a particular branch 
of English writing a competent instructor will seldom be at a loss 
to say which is the more successful; certainly if the choice is so 
difficult as to create doubt, a choice one way or the other cannot 
be said to constitute injustice. And so an instructor may, without 
substantial unfairness, rank a series of themes from best to worst; 
so, too, he may divide a series of themes into consecutive groups 
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according to any predetermined classification of marks and assign 
to each group its appropriate relative value. This ranking is 
obviously easy when all members of a class work at identical 
assignments; nor is it difficult to do substantial justice where 
individual assignments are made, for a competent instructor may 
be assumed to be able to appreciate relative difficulties to be dealt 
with quite as well as relative excellences in achievement. And 
to fix the rank of a student's work for a whole term it is necessary 
only to reduce the values of all marks given to each individual to 
some convenient form of simple average such as the arithmetic 
mean, the median, or the mode. And if, in all courses finding their 
constituencies among students approximately equally advanced in 
their academic work, virtually equal proportions of students be 
given each mark, the ranking of students thus obtained must fur- 
nish a dependable basis for such administrative action and such 
academic awards as the school bases upon marks given. 

The qualifications of the latter statement are a highly essential 
part of it. For it may be maintained (though it is not here asserted) 
that a better quality of work should be expected of a Senior taking 
some Freshman course than is expected of other members of the 
class; and this Senior may be marked on the basis of what a whole 
group of Seniors in such a course might be reasonably expected to 
accomplish. And if, as we find in universities and colleges, con- 
tinued membership in the school is conditioned upon maintaining 
some minimum average performance, it is to be desired that what- 
ever eHmination of weaker students is to take place should take 
place as early as possible. Because of this and because entering 
classes generally include a higher proportion of weaker students 
than those classes further advanced, it may be desirable to give 
a smaller proportion of the highest marks and a larger proportion 
of the lower marks to incoming classes. In public high schools 
this problem of elimination is at a minimum; and no violence to 
justice is done if within all class-groups virtually equal proportions 
of students be awarded any given mark. 

The other qualification, viz., that virtually equal proportions 
of students in all class-groups be given each mark, must not be 
taken to mean that any prescribed standard of proportioning must 
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be followed without respect to the number or personnel of the 
class-group. Class-groups may be good and bad relatively to 
others, just as individuals may. The smaller the characteristic 
registration for a section of a course the larger must these group 
variations be. If, in addition to the standard proportioning of num- 
bers to receive each mark, limits of variation from this standard 
sufficiently wide to accommodate differences between class-groups 
be set or, better still, if each student be marked with reference 
to some larger and more stable group of, say, 100 students, e.g., 
if a given mark is made to mean that the student receiving it 
would stand among the best or the second or the third tenth of 
the ideally large group, no substantial unfairness to the individual 
student need result from the accidental size or personnel of the class. 

No very extensive experience on the part of the instructor is 
necessary to enable to him to pass judgment upon a class as a whole 
as compared to others that he has had or that he may expect to 
have. Nor is it more difficult to estimate in nine of ten cases 
within what tenth of a group of one hundred the work of a given 
student stands. Few teachers of any considerable experience who 
have given thought to the problem will doubt that the fluctuation 
among small groups is large. Within the writer's own very brief 
experience there has been one class-group of thirty-one no member 
of which attained such an excellence of performance as to rank 
him among the best tenth of 100 students, and there has been 
another class-group of eighteen within which four students were 
clearly the best the writer has ever had. 

The writer has recently made a statistical study of all marks 
given during a period of five years in ten courses (two in each of 
five departments) in a very large school whose marking system is 
based upon implied positive standards. The final marks of 8,500 
students were included. Forty instructors had had more than 
100 students each in the groups included. A complete statement 
of the method followed and of the results would be premature at 
this time, but at some time in the near future, when the writer has 
made similar studies of marks given in other schools with different 
marking systems, the methods and results will be published. A 
few of the more significant conditions found may, however, serve 
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the purpose of illustrating the need of some definitely defined 
standard for the distribution of marks. 

Large differences in marking were found as between different 
departments. In one course out of 1,232 students only 20, or 
1 . 6 per cent, had received the highest mark in the range. In a 
course in another department out of 395 students 113, or 28.9 per 
cent, had received this highest mark. In the first of these two 
courses 37.7 per cent received marks below the average required 
for graduation. For the other course the corresponding percent- 
age was 10.4. These two courses draw their constituencies from 
students equally advanced. Administrative action and award of 
honors are based upon the marks in one course precisely as they 
are in the other. Can any given mark be reasonably supposed 
to represent the same relative excellence in both courses ? 

Wide differences are found as between different instructors 
giving the same course at the same time to equally advanced 
students. In one course one instructor with 343 students during 
the period gave no one the highest mark in the system. A col- 
league of his giving the same course during the same terms to 302 
students gave 1 1 . 6 per cent of them the highest mark. In another 
course in another department one instructor gave the highest mark 
to 47 out of 101 students. A colleague of his gave only 14 out 
of 108 as high a mark. In one course one instructor with 345 
students gave 22.2 per cent of them marks below the level required 
for graduation. A colleague offering the same course gave only 
1.6 per cent of his 302 students marks as low as this. These 
differences, though larger than the general run, are neither extreme 
nor uncommon. The particular comparisons are made only 
because they include fairly large and fairly equal numbers and 
because the classes were taught in the same terms. 

Within this group of courses any student, no matter what his 
ability (unless he were among the very best or the very worst), 
who had had the best of unofficial advice as to which instructors 
were the high markers could have averaged more than one mark 
higher for the ten courses (there are nine co-ordinate marks in the 
system) than he would have obtained had he taken the same courses 
with the lower marking instructors. He could have elected five 
courses with high-marking instructors that would have yielded 
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him, on the average, more than two marks higher on each course 
than he could have had on the other five courses with low-marking 
instructors. 

A number of other important matters which may be dealt with 
briefly at this time appeared in the course of this statistical study. 
One of these is that plus and minus marks in conjunction with the 
letter or number characters representing the grades of the system 
are likely to lead to much confusion and may, indeed, quite dis- 
rupt the workings of an otherwise good system. If "i+, i, and 
i — " are used as co-ordinate and separate marks teachers will be 
very likely largely to disregard one or two of them and regard a 
i— as a poor i, or a i+ as a good i and, in consequence, to return 
almost no i — 's, etc., i.e., to use only a part of the marking system. 
It is better to make the group classifications sufficiently small and 
to have a separate single character for each. 

Another of these smaller points is that numbers serve the pur- 
pose of designating relative standing rather better than letters. 
If there are to be eight marks in a system each standing for some 
particular i2§ per cent of the students a mark of 6 more readily 
conveys to the student that 25 per cent of the students rank below 
him and 625 per cent rank above him than does a mark of F; for 
everyone is f amiliar with the idea of the relativity of simple numbers. 

A third of these smaller matters is the tendency of some instruct- 
ors, if left to their own notions, to award the highest mark of a 
system only to those rare students who prove by the extraordinary 
excellence of their work that they are completely out of place 
in the course and who, for their own best interest, should be put 
in more advanced work where they will find something to struggle 
with. (This, quite obviously, does not apply to such courses in 
necessary sequence as elementary mathematics.) To reserve 
a mark for this purpose is very largely to use a marking system 
with one classification less than is intended. This must not be 
taken to mean that this upper class must not be made small; it 
may be extremely desirable to have a special high mark to mean 
that the student awarded it shows high promise of future excellence 
in the more advanced work of the subject. Used for this purpose 
the mark reports practically a positive standard, but none the 
less valid. 
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A fourth matter is the distinction that ought to be made between 
the mark that indicates failure and the marks that indicate suc- 
cessive degrees of excellence in passing work. Here is another 
quasi-positive mark. The number to be required to repeat a 
course either for credit or as a condition to more advanced work 
in the same subject ought to depend rather upon whether or not 
the student has proved himself able to do more advanced work 
than upon any number that it is estimated should be failed. The 
number that should be required to repeat can always be best 
determined by the department and by the individual teacher. 
The determination of failure is more a matter determining what 
shall constitute excellence of work than of determining relative 
rank of students. 

It is not intended here to advocate any particular proportion 
of marks, but an illustration may serve the purpose of showing 
what some particular distribution of marks will do. Let us 
suppose that our marking system for those who pass includes ten 
classifications, i, 2, 3, 4 ... . 10. Let 1 represent the best work 
and 10, work barely passing (failure being represented by o or by 
some letter). Let the best tenth be given the mark 1, the second 
tenth, 2, and so on. A mark of 1 or of 6 not only gives the student's 
rank with relation to others in his course but is strictly comparable 
with a like mark in any other course. A mark of 1 obviously 
means more in such a system than it could possibly mean if one 
teacher characteristically gave one- third of all students the highest 
mark in the system and another teacher as characteristically gave 
the mark to one-twentieth of his classes. 

The hypothetical system of distributing marks sketched above 
may be altered to suit the conditions of any school. If it seems 
desirable to make a special mark to indicate those who show great 
promise of future excellence in a department the proportioning of 
marks may be limited to those who fall outside both this class and 
the failure class. If fewer classifications seem desirable it is easy 
to change the proportioning. But the important matter is to 
decide first what is wanted of a marking system and only then 
to fix upon a proportioning that will bring this about. 



